Rank in Wordlist | Frequency | Word |
---|---|---|
1773 | 1 | 4,26-29 |
1774 | 1 | 4,35 |
1775 | 1 | 4,37-38 |
3616 | 1 | monts,fruâts |
3685 | 1 | nissun",cence |
Rank in Wordlist | Frequency | Word |
---|---|---|
4467 | 1 | su(cun |
Rank in Wordlist | Frequency | Word |
---|---|---|
1831 | 1 | Bataion"Martiri |
2634 | 1 | comandant"Mario":"Il |
3685 | 1 | nissun",cence |
Rank in Wordlist | Frequency | Word |
---|---|---|
110 | 18 | ch'al |
330 | 6 | ch'a |
331 | 6 | ch'o |
541 | 4 | ch'e |
1352 | 2 | l'aghe |
1353 | 2 | l'anime |
1354 | 2 | l'at |
1355 | 2 | l'om |
2020 | 1 | L'associazion |
2021 | 1 | L'aut�r |
Rank in Wordlist | Frequency | Word |
---|---|---|
1245 | 2 | e/o |
1769 | 1 | 3/2001 |
1777 | 1 | 44/45 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots